Spectral Techniques for Partitioning Planted Random Graphs
نویسنده
چکیده
This my final project paper for the class 18.S996: Algorithmic Aspects of Machine Learning, taught in Fall 2013 by Prof. Ankur Moitra. In this note, I will explain the spectral method for partitioning a graph as used in the following two papers: [1] N. Alon, M. Krivelevich, and B. Sudakov, Finding a large hidden clique in a random graph. [4] F. McSherry, Spectral partitioning of random graphs. In [1], the problem is to recover a planted clique of size c √ n in a random graph G(n, 1/2). That is, we first generate a random graph G(n, 1/2), and then randomly put in a clique of size c √ n, and the algorithmic goal is to recover this clique. In [4], variations and extensions of this planted clique problem are considered, including planted bisection and k-coloring. In each case, the graph is generated by first partitioning the vertices into parts of prescribed size (the algorithm doesn’t know which vertices belong to which partition) and then edges are places between parts and within parts using certain prescribed probabilities. The goal is to recover the original partition given the resulting random graph. In [4], McSherry considers probabilities that depend on n, the size of graph, whereas in [1], only the constant 1/2 probability is considered. These problems are all variations of classical NP-hard optimization problems which are even hard to approximate. However, the planted versions turn out to be more amenable, at least in certain parameter regimes. Note that our goal is to recover the planted object/partition, and not necessarily to solve the optimization problem (e.g., finding the largest clique), although these two goals can coincide (as they do in the case of planted clique, since the largest clique in G(n, 1/2) has size (2 + o(1)) log2 n). I will give only a very rough sketch of the proof ideas, without giving much details or calculations.
منابع مشابه
Spectral Clustering by Recursive Partitioning
In this paper, we analyze the second eigenvector technique of spectral partitioning on the planted partition random graph model, by constructing a recursive algorithm using the second eigenvectors in order to learn the planted partitions. The correctness of our algorithm is not based on the ratio-cut interpretation of the second eigenvector, but exploits instead the stability of the eigenvector...
متن کاملConsistency of Spectral Hypergraph Partitioning under Planted Partition Model
Hypergraph partitioning lies at the heart of a number of problems in machine learning and network sciences. A number of algorithms exist in the literature that extend standard approaches for graph partitioning to the case of hypergraphs. However, theoretical aspects of such methods have seldom received attention in the literature as compared to the extensive studies on the guarantees of graph p...
متن کاملSpectral Clustering by Recursive
In this paper, we analyze the second eigenvector technique of spectral partitioning on the planted partition random graph model, by constructing a recursive algorithm using the second eigenvectors in order to learn the planted partitions. The correctness of our algorithm is not based on the ratio-cut interpretation of the second eigenvector, but exploits instead the stability of the eigenvector...
متن کاملSpectral Partitiong in a Stochastic Block Model
In this lecture, we will perform a crude analysis of the performance of spectral partitioning algorithms in what are called stochastic block models or a planted partition model. The name you choose largely depends on your community and application. As we are especially interested today in partitioning, we will call it the planted partition model. In this model, we build a random graph that has ...
متن کاملSpectral Clustering of Graphs with General Degrees in the Extended Planted Partition Model
In this paper, we examine a spectral clustering algorithm for similarity graphs drawn from a simple random graph model, where nodes are allowed to have varying degrees, and we provide theoretical bounds on its performance. The random graph model we study is the Extended Planted Partition (EPP) model, a variant of the classical planted partition model. The standard approach to spectral clusterin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013